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Abstract 

Matchgates are an especially multiflorous class of two-qubit nearest neighbour quan- 
tum gates, denned by a set of algebraic constraints. They occur for example in the 
theory of perfect matchings of graphs, non-interacting fermions, and one-dimensional 
spin chains. We show that the computational power of circuits of matchgates is equiv- 
alent to that of space-bounded quantum computation with unitary gates, with space 
restricted to being logarithmic in the width of the matchgate circuit. In particular, 
for the conventional setting of polynomial-sized (logarithmic-space generated) families 
of matchgate circuits, known to be classically simulatable, we characterise their power 
as coinciding with polynomial-time and logarithmic-space bounded universal unitary 
quantum computation. 



1 Introduction 

The study of relationships between various kinds of quantum computational resources is 
one of the most fundamental issues in the theory of quantum computation, and one may 
explore a variety of possible avenues. On the one hand we may study the power of quantum 
computations that use a restricted class of gates or computational steps (that are generally 
not fully universal) e.g. computations with Clifford gates [16] or with nearest-neighbour 
matchgates \21 \ I20 |, [12] . (The term "nearest-neighbour" here refers to the requirement that 
the matchgates act only on consecutive pairs of qubits in the 1-dimensional qubit array of 
the circuit). Each of these gate classes is defined by suitable algebraic constraints and in 
these cases they lead to computations that turn out to be classically efficiently simulatable. 
Despite this classical ceiling on computing power, these quantum computational processes 
are still of considerable interest because, for example, both kinds can generate complex 
entangled states and hence illuminate the sometimes alleged blanket attribution of quantum 
computational power to the mere presence of entanglement. 
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A second approach is to allow unrestricted kinds of (suitably local, unitary) quantum 
gates, e.g. they may be freely chosen from some finite universal set, and then place restric- 
tions on the nature and amount of free computational resources that are available for the 
computation. Examples of such resources include time (number of steps), space (number of 
qubits), circuit depth (parallel time), deterministic vs. non-deterministic computation, etc. 
Perhaps the most familiar example of such a restriction is that of polynomially bounded 
time, but others that have been established as conceptually and practically significant in 
the literature of classical complexity theory may also be entertained. 

For our present work the notion of (classical or quantum) space-bounded computation 
will be a fundamental ingredient. We digress here to give a brief intuitive account of 
it using the significant and illustrative case of logarithmic space-bounded computations, 
called log-space computations for short. Let X — X ]_ * * * Xfi be the binary string input for 
a computation. We wish to develop a notion of the computation being carried out in a 
restricted space of size only O(logn). Although this space is too small to even contain the 
input, the notion can be made meaningful and natural as follows. The input is given in an 
area of memory called the input tape, that is read only and cannot be used for computational 
processing. (Use of the term "tape" here is motivated by a more formal definition based 
on the Turing machine model, which is described in the Appendix.) Computation is then 
carried out in a separate area of memory called the work tape, of size O(logn). As the 
computation proceeds, different parts of the input may be read and copied to the work 
tape but only a very small, logarithmic length part may be represented there at any one 
time. Note, however, that numbers from 1 to n can be represented with logn bits, so the 
algorithm can at least remember locations in the input and return to them later if desired. 
A familiar example of the spirit of these definitions is our everyday use of the Internet: we 
may access desired locations, downloading data for processing on our computer's hard disk, 
and remember web addresses, but our hard disk is far too small to simultaneously contain 
the whole Internet. 

The above considerations lead to well-defined notions of log-space computations for both 
classical and quantum computers, which we discuss in greater detail in the Appendix. In 
the classical case (with deterministic computational steps) any log-space computation (if 
it halts) must run in polynomial time \17\ [2]. In a similar vein in the quantum case, any 
log-space quantum computation may be simulated classically in polynomial time: O(logn) 
qubits correspond to 0(poly(n)) dimensions and the progress of the computation of N steps 
may then be classically directly calculated in time 0(iVpoly(n)). Thus, log-space quantum 
computation provides another natural class of classically efficiently simulatable quantum 
computations. 

As mentioned above it is known |214 [20| [T2] that polynomial sized circuits of nearest 
neighbour (n.n.) matchgates can be classically efficiently simulated so that their compu- 
tational power is at most that of classical poly-time computation. A main result of the 
present paper is the precise identification of their power — we show that it coincides with 
the computational power of quantum log-space computation (in which the computational 
steps are unitary operations rather than fully general trace preserving completely positive 
maps \26\ [271 I28|). Thus we obtain an equivalence between a class of computations re- 
stricted by algebraic constraints (n.n. matchgates) on the one hand, and a class obtained 
by limitations on the amount of use of free computational resources (log-space computation 
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with general unitary gates) on the other. Actually we prove a more general result asserting 
an equivalence (in a precisely defined sense) between the computational power of general 
quantum circuits of size (number of gates) M and width (number of qubit lines) m on 
the one hand, and nearest neighbour matchgate circuits of size N = 0(2 2m M) and width 
n = 2 m+1 on the other hand (and then we can set m = O(logn) and M = poly(n) to obtain 
the poly-bounded case mentioned above). 

Our result is similar in spirit to a theorem of Aaronson and Gottesman pQ , who showed 
that the computational power of Clifford (or stabilizer) circuits coincides with that of the 
classical complexity class known as ©L (defined in terms of a certain kind of classical 
nondeterministic log-space computation). In contrast, in our case we have an equivalence 
between two kinds of quantum computations. 

In the next section we will recall basic facts about the classical simulation of nearest- 
neighbour matchgate circuits, extending these results for our later purposes and establishing 
some notations. In Section [3] we give a precise statement of our results with reference 
to an Appendix, which contains the definitions of space-bounded classical and quantum 
computation that we use. In Section 2] we give the proof of our main result, and in Section 
[5] some further concluding remarks. 

2 Quantum matchgate circuits and their classical simulation 

We will follow the notational conventions used in [12] . Let X, Y, Z denote the standard 
qubit Pauli operators. A matchgate is defined to be a two-qubit gate G(A, B) of the form 
(in the computational basis): 

where A and B are both in SU(2) or both in U(2) with the same determinant. Thus the 
action of G(A, B) amounts to A acting in the even parity subspace (spanned by 1 00) and 
|11)) and B acting in the odd parity subspace (spanned by 1 01) and 1 10) ) . 

For any quantum (matchgate or conventional) circuit C, its size N is its total number 
of gates, and its width n is the total number of qubit lines upon which its gates act. We will 
disregard circuits having qubit lines on which no gates act, so that N > n/2 for all circuits 
to be considered. 

A fundamental classical simulation result for matchgate circuits is the following |12] (cf. 

[ZD ED]). 

Theorem 1. Consider any matchgate circuit of size N and width n, such that: 

(i) the matchgates G(A,B) act on nearest neighbour (n.n.) qubit lines only; 

(ii) the input state is any computational basis state \x± ■ ■ ■ x n ); 

(Hi) the output is a final measurement in the computational basis on any single qubit line. 
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Then the output may be classically efficiently simulated. More precisely for any k we can 
classically compute, in poly(iV) time, the expectation value (^fe)out = Po — Pi (and hence 
also po and p\), where po,p\ are the outcome probabilities and Zk is the Pauli Z operator 
on the k-th line. 

As in this theorem, two-qubit matchgates in this paper will always be taken to act only 
on nearest neighbour qubit lines so henceforth the term "matchgate" will mean "nearest 
neighbour matchgate" . 

For later purposes we will need some details of how this classical simulation is actually 
achieved. We begin by introducing the 2n hermitian operators on n-qubits (omitting tensor 
product symbols <S> throughout): 

Cl = XI---I c 3 = ZXI---I ■■■ C2fc_i = Z-ZXI-I ■■■ 

c 2 = YI---I c A = ZYI---I ■■■ c 2k =Z-ZYI-I ■■■ U 

where X and Y are in the k-th slot for c 2 k-i and c 2 k, and k ranges from 1 to n. Thus the 
operators c 2 k-i, c 2 k are associated to the k-th qubit line. It is straightforward to check that 
these matrices satisfy the anti-commutation relations 

{cj, ci} = CjCi + c/Cj = 28j jt I j, I = 1, . . . , 2n. (3) 

These relations define a Clifford algebra C 2n on 2n generators and the operators in eq. ([2]) 
constitute what is known as the Jordan- Wigner representation [9] of the Clifford algebra. 

Next we note the following properties (all proved in [E]). If U is any matchgate acting 
on lines (k, k + 1) then 

U \ c . U= l T.ttk-1 R[3\ l]d for j = 2k — 1, 2k, 2k + 1,2k + 2, 
3 \ Cj for all other j's, 

where R £ <SO(4, M) is a special orthogonal matrix and the /-summation extends over the 
four q operators associated to lines k and k + 1. Furthermore this association of rotations 
to matchgates is surjective, i.e. every R € 50(4, M) arises from a matchgate U (unique 
up to an overall phase). To incorporate all lines k on an equal footing we can regard R 
as a rotation in 2n dimensions acting as the identity outside the 4-dimensional span of 
dimensions 2k — 1, 2k, 2k +1 and 2k + 2. 

Now let 

IVwt) = U |^ in ) = U N ■ ■ ■ Ui \xi ■ ■ ■ x n ) 

represent the action of a circuit of TV matchgates on input \x\ ■ ■ ■ x n ). Then for a final Z 
measurement on line k we have 

PO - pi = {Z k ) = (Vin| U^Z k U IVhx) • (5) 

From eq. ([2]) it holds that 

Zk = —ic 2 k~ic 2 k 

and so 

U^ZkU = -i{U ] c 2k -iU){U^c 2k U). (6) 
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Let Rt be the SO(2n,M) rotation associated to Ut via eq. (jl]) (extended to 2n dimensions), 
and write R = Rn • • • R\- Then from eqs. (|6l5p we get 



2n 

PO ~ Pi = S ^t 2 * " l,i]-R[2*S, /] (V'inl " iCj'Q |^ in ) . (7) 

This finally gives our efficient classical simulation of matchgate circuits by noting the fol- 
lowing features of the formula in eq. (UJ): 

(i) R is a product of N matrices of size 2n x 2n (with n < 27V) so i? can be computed in 
time poly(JV) by sequential matrix multiplication. 

(ii) The Cj operators are all product operators and is a product state of n qubits — so 
{ipin\ CjQ \ipm) can be computed as a product of n factors in time 0(n) = O(N), for 
each of the 0(n 2 ) choices of (j, I). 

(hi) The summation in eq. ((Jj) has only 0(n 2 ) = 0(N 2 ) terms, so in view of (i) and (ii), 
Po ~ Pi can be computed in time poly(iV). In particular, polynomial-sized matchgate 
circuits (i.e. ./V = poly(re)) can be simulated classically in polynomial time. 

To develop a relationship of such circuits to space-bounded quantum computation we 
begin by giving an alternative expression of eq. ([7]). Consider first the case of k = 1 
(i.e. measurement on the first line) and \ipi n ) = 1 - • • 0) . A direct calculation with the 
Jordan- Wigner operators gives 



(00---0| -icjci |00 • • ■ 0> 



1 if (j, I) = (2k' - 1, 2k') 

-1 if (j,l) = (2k',2k'-l) 

-i iij = l (8) 

otherwise. 



Because the rows of the matrix R are orthonormal, the sum of terms in eq. ([7]) with j = I 
vanishes, and we can write 

2n 

(Z 1 ) = ^2R[l t j]R[2,l\S[l,j], (9) 

3,1=1 

where we have introduced the matrix of off-diagonal terms from eq. ([!]): 

1 if = (2k' - 1,2k 1 ) 

-1 i?(j,l) = (2k',2k'-l) (10) 
otherwise 

i.e. S = 0" =1 1" is the block diagonal anti-symmetric matrix with n copies of 

y=>y=(^ I) (") 

on the diagonal. 
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The matrices R and S have size 2n x 2n and real entries, and using the Dirac notation 
|1) , . . . , |2n) for an orthonormal basis in the associated real vector space we can write eq. © 

as 

(Zi) = (2\RSR^ |1) (12) 

(where we have used the fact that the transpose of a rotation matrix is its inverse). Note 
finally that R and S are actually (real) unitary matrices and hence can be viewed as quantum 
operations in 2n dimensions i.e. on O(logn) rebits. Here and henceforth we will use the 
term "rebit" to refer to a qubit whose state components (in the fixed basis being used) are 
restricted to be real numbers. 

The general case of (Zk) and a general input state \ipm) is similar: in eq. (|12p we replace 
(2| and |1) by (2k\ and \2k — 1) respectively and S, depending on the input state, is replaced 
by the real antisymmetric matrix 

S[l,j] = {-0in| - iCjCi |V>in) . (13) 

A computational basis input state \x) = \x\ ■ ■ ■ x n ) gives 

2n 

S(x) = ^(-i^r^i |j - (-i)i> <j| 

which is a "pairwise swap" operation with a conditional phase (— iy +x tt/ 2 ~\ } anc | then 

(Z k (x)) = (2k\ RS{x)R~ 1 \2k - 1) . 

The nearest neighbour restriction on matchgate actions in Theorem [1] is in fact a crucial 
ingredient for the classical simulability of the circuits (as indeed n.n. and next-n.n. actions 
are already universal for quantum computation [E]). Note that n.n. matchgate actions may 
be extended to arbitrary line pairs using the qubit SWAP operation on n.n. lines. However 
SWAP is not a matchgate (even though SWAP = G(I,X) but det(J) ^ det XI) A closely 
related bonafide matchgate is the operation W = G(Z,X), being SWAP together with 
"introduction of a minus sign when both lines are 1" i.e. a fermionic SWAP operation if we 
view qubit labels and 1 as occupation numbers for fermionic modes. For n.n. lines k and 
k+1, the corresponding Jordan- Wigner operator pairs are interchanged under conjugation 
by W viz. W^c 2 k+iW = c 2 k-i and W' ( c 2 k+ 2 W = c 2k and therefore Z k +\W = Z k - 

A circuit C with input \x\ ■ ■ ■ x n ) and measurement on the fe-th line will be called 
equivalent to a circuit D with input \y\ ■ ■ ■ y m ) and measurement on the l-th line, if they 
have the same probability distribution for their output measurements. (Strictly speaking 
we should here include a precision tolerance to allow for change of choice of finite basic gate 
sets, but this technical detail may be readily accommodated and we henceforth ignore it.) 

Using the properties of W above we can standardise the form of matchgate circuits as 
follows, which will be convenient for later purposes. 

Lemma 1. Let C be any matchgate circuit of size N and width n < 2N , with input 
\x\ ■ ■ ■ x n ) and final measurement Zk on the k-th line. Then there is an equivalent matchgate 
circuit D of size N + 0(n 2 ) and width n or n + 1, with input |0 • • • 0) and final measurement 
Z\ on the first line. 
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Proof: The description of D is obtained from C and its input string follows. 
Let r be the Hamming weight of the input string and let k and l k denote length k strings 
of Os and Is, respectively. If r is even we take n lines for D initialised to |0 n ). We then apply 
r/2 G(X, X) n.n. gates to the bottom r/2 lines pairs to obtain |0 n-7 T r ). Next using 0(n 2 ) 
n.n. W = G(Z, X) gates we successively swap the r lower Is into their positions in the input 
string to obtain ± \x\ ■ ■ ■ x n ). Next apply the circuit C. Finally if C has final measurement 
on line k, apply a ladder of k — 1 n.n. W gates to swap the k-th line into the first position, 
and the equivalent final measurement is now Z\ . This completes the description of D when 
r is even. If r is odd we take n + 1 lines for D, applying [r/2] n.n. G(X, X) gates to get 
|0 n_r l r+1 ). We then leave line n + 1 untouched by any further gates and proceed as above 
for the first n lines to obtain D. □ 



3 Main results 

For the remainder of this paper, MG = MG(n; N;x\ ■ ■ ■ x n ;k) will denote a given matchgate 
circuit of width n, size N, input \x±---x n ) and final measurement on the A;-th line. 
Similarly, QC = QC(m; M;yi ■ ■ ■ y m ) will denote a quantum circuit of width m, size M, 
input | j/i • • • y m ) and final measurement (without loss of generality) Z\ on the first line. 

To express our main results we will need the notion of a classical description or encoding 
of a quantum (matchgate or conventional) circuit. We will assume that all our circuits are 
composed of gates from fixed finite sets of one- and two-qubit gates, whose products densely 
generate the class of all gates of the kind being considered. In the proofs given in Section [J] 
below some particularly convenient sets will be chosen but using standard arguments based 
on the Solovay-Kitaev theorem [16] it may be shown that our results do not depend on 
such particular choices. The action of each gate in a quantum circuit of width n and size 
N can be specified by a triple (g,i,j) where g encodes which of the finite gate types it is 
and i,j € {1, • • • , n} specify which of the qubit lines it acts upon. In any circuit we discard 
lines on which no gates act so N is at least n/2. The full circuit is then encoded as a 
concatenation of its sequential gate applications: 

{gi,h,h){92,i2,h) ■ ■ ■ i,9N,iN,3N)- (14) 

All of these symbols may be represented as binary strings (of constant length for the gate 
types g and length log n for the line numbers i and j) giving an encoding of length 0(N log n) 
for any circuit of width n and size N. This encoding may also include specification of an 
input x\ ■ ■ ■ x n and line number k for the final measurement, if desired. In the case of 
matchgate circuits we could omit all the j values since for matchgates we always have 
j t = i t + l for t = l,--. ,N. 

Theorem 2. The following equivalence between matchgate circuits and general quantum 
circuits holds. 

(a) Given any matchgate circuit MG(n; N;xi ■ ■ ■ x n ; k) there exists an equivalent quantum 
circuit QC(m; M; • • • 0) with m = [logn] + 3 and M = 0(N log n). Moreover, the 
encoding of the circuit QC can be computed from the encoding of the matchgate circuit 
MG by means of a (classical) spac$\ O(logra) computation. 

See the Appendix for definitions of space-bounded computation. 
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(b) Conversely, given any quantum circuit QC(m; M; y\ ■ ■ ■ y m ) there exists an equivalent 
matchgate circuit MG(n; N; 0- • • 0; 1) with n = 2 m+1 and N = 0(M2 2m ). Moreover, 
the encoding of the matchgate circuit MG can be computed from the encoding of the 
circuit QC by means of a classical space 0(m) computation. 

Before giving the proof in the next section, we discuss here some consequences. First, 
to illustrate the content of the theorem, consider a polynomial-sized matchgate circuit, 
i.e. one with N = 0(poly(ra)). Then Theorem [2]^ a) states that it can be simulated by a 
circuit QC of poly(n) size and exponentially compressed width O(logn), and the translation 
of descriptions can be carried out with a comparatively modest (log-space) computational 
cost. The latter remark is important in that we would like the simulation of the matchgate 
circuit to be a feature of the QC circuit's computational power rather than being subsumed 
in the computational power of the translation of MG to QC. Conversely, Theorem EJb) 
with parameters m = O(logre) and M = 0(poly(n)) asserts that any poly(ra) sized quantum 
circuit of logarithmic width 0(log n) may be simulated by a matchgate circuit of polynomial 
size (and hence also of polynomial width). In this sense we have a full equivalence between 
the computational power of polynomial-sized matchgate circuits and universal logarithmic- 
width polynomial-sized unitary quantum circuits (modulo classical log-space translations). 

To elevate such observations to formal statements about quantum computational com- 
plexity classes we need to introduce suitably defined families of computational tasks (pa- 
rameterised by increasing input size n) and associated computational complexity classes. 
When characterising computational complexity in terms of the circuit model, we need to 
consider families of circuits subject to a suitable restriction for how these circuits are gener- 
ated as a function of the input and its size n. In contrast, the Turing machine (TM) model 
requires no such auxiliary condition, as a single TM can deal with inputs of all lengths. For 
example, for usual polynomial-time quantum computation we conventionally use families 
of circuits whose descriptions are generated from the input string by a polynomial-time 
classical computation. But the complexity classes that we are aiming to characterise (viz. 
poly-sized matchgate computations and a notion of log-space quantum computation) are 
themselves no stronger than classical polynomial-time computation, so we need to adopt a 
more strict condition. A natural choice is classical log-space generation, commonly used in 
classical complexity theory for classes that are contained within classical polynomial-time 
computation. 

Let S* denote the set of all finite length bit strings. A promise problem is a pair 
A = (A yes ,A no ) of subsets A yes ,A no C S* with ^4 yes n A no = 0. Intuitively speaking, a 
promise problem expresses a computational decision problem, where the strings in the set 
Ay es are those input strings for which the correct answer is "yes" (or the binary value 1) 
and the strings in A no are those for which the correct answer is "no" (or the binary value 0) . 
Input strings contained in neither of the sets ^4 yes and A QO are "don't care" inputs for which 
no answer is required. Promise problems for which Ay CS U^4 no = £*, i.e. that disallow "don't 
care" inputs, are called languages (and are generally represented simply by the set A yes ). 

With this terminology in mind, consider the following definitions. 

Definition 1. Log-space generated families of matchgate circuits and the complexity classes 
BMG, PMG, BQI_u and PQLy are defined as follows. 
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(a) A log-space generated family of matchgate circuits is a classical log-space computable 
function g on X* such that, for each w 6 E*, the string g(w) is an encoding of a 
matchgate circuit MG(n; N; x% ■ ■ ■ x n ; k). 

(b) A promise problem A = (A yDS , A no ) is computed with bounded error by a log-space 
generated family g of matchgate circuits if and only if, for all strings w € A yes the 
matchgate circuit encoded by g(w) outputs 1 with probability at least 2/3, and for 
all strings w £ A no the matchgate circuit encoded by g(w) outputs 1 with probability 
at most 1/3. 

(c) A promise problem A = (A yes , A no ) is computed with unbounded error by a log-space 
generated family g of matchgate circuits if and only if, for all strings w € A yes the 
matchgate circuit encoded by g(w) outputs 1 with probability strictly greater than 
1/2, and for all strings w € A no the matchgate circuit encoded by g(w) outputs 1 with 
probability at most 1/2. 

(d) The class BMG consists of all promise problems computable with bounded error by a 
log-space generated family g of matchgate circuits, and the class PMG consists of all 
promise problems computable with unbounded error log-space generated family g of 
matchgate circuits. 

(e) The class BQLy consists of all promise problems computable with bounded error 
by a log-space unitary quantum computation, and the class PQLy consists of all 
promise problems computable with unbounded error by a log-space unitary quantum 
computation. (These notions are described in the Appendix.) 

It was proved in refs. |26| [27] that, when quantum operations are restricted to those ex- 
pressible by matrices of algebraic numbers, it holds that PQLy = PL, where PL is the 
class of promise problems computed with unbounded error by classical probabilistic Turing 
machines operating in logarithmic space. 

In terms of the above defined notions, Theorem[2]has the following immediate corollaries. 

Corollary 1. Let V = {(Pq ,pf) '■ w € E*} be a family of probability distributions on 
Boolean values, parameter is ed by binary strings w. Then V arises as the output distribution 
for a log-space generated family of matchgate circuits if and only if V arises as the output 
distribution for a unitary log-space quantum computation. 

Proof. The two directions of implication follow easily from Theorem [21 noting that then 
both directions of translation are computable in classical log-space, and using the inter- 
pretation (mentioned in the Appendix, taking f(n) = logn there) of log-space quantum 
computation in terms of log-space generated families of log-width quantum circuits. □ 

Corollary 2. It holds that 

(a) BMG = BQL U; and 

(b) PMG = PQLy. 
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We may alternatively use the notion of completeness of a language or promise problem for 
a complexity class to express the equivalence of the computational power of polynomial-sized 
matchgate circuits and log-space quantum computation. A promise problem A = (A yes , A no ) 
is complete for a complexity class C if A S C and any promise problem B = (B ycs , B no ) in 
C can be reduced to A. There are many differing notions for the reduction of one problem 
to another, but we will consider log-space reductions. The choice to consider log-space 
reductions is analogous to the log-space generation of circuit families, and provides a non- 
trivial notion of completeness for the classes under discussion. 

Definition 2. A function g : S* — s> S* is a log-space reduction from a promise problem 
A = (A yes ,A no ) to another promise problem B = (B yes ,B no ) if g is a (classical) log-space 
computable function and it holds that g(w) G B yes for all w S A ycs , and g(w) G B no for all 
w € A no . 

Thus, a promise problem B is complete for BQLy if B € BQLy and, for every promise 
problem A € BQI_u, there exists a log-space reduction g from A to B. The definition for 
PQLy is similar. 

Now consider the following promise problems that formalize the computational tasks of 
simulating matchgate circuits of the sort that we consider. (These problems are stated in a 
standard way that makes clear which strings belong to the yes- and no-sets.) 

The bounded-error matchgate circuit problem (BMGC for short) 

Input: An encoding of MG(n; N;x±- • • x n ; k), a nearest neighbour quantum matchgate 
circuit, its input string, and its output qubit. 

Yes: The output of MG is 1 with probability p > | . 

No: The output of MG is 1 with probability p < |. 

The unbounded-error matchgate circuit problem (MGC for short) 

Input: An encoding of MG(n; N;xi ■ ■ ■ x n ;k), & nearest neighbour quantum matchgate 
circuit, its input string, and its output qubit. 

Yes: The output of MG is 1 with probability p > ^. 

No: The output of MG is 1 with probability p < ^. 

The equivalence between matchgate circuits and space-bounded quantum computations 
established by Theorem [2] gives the following corollary. 

Corollary 3. With respect to classical log-space reductions, we have 

(a) BMGC is BQLy- complete, and 

(b) MGC is PQl u -complete. 

Thus intuitively, any instance of any problem in BQI_u can be reduced to approximating 
the output of a matchgate circuit having bounded error, and conversely the latter can be 
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computed in BQLy. This corollary is formally similar to the result of [Tj characterising the 
computational power of stabiliser circuits, where it is shown that an analogously defined 
language for stabiliser circuits (rather than our matchgate circuits here) is complete for the 
complexity class 0L (again relative to log-space reductions). 



We will first describe a procedure for converting MG(n; N; x\ ■ ■ ■ x n ; k) into a circuit QC of 
the claimed size and width, and then discuss the implementation of this procedure in space 
O(logn). We will consider first the case where the given matchgate circuit takes the form 
MG(n; N;0 ■ • • 0; 1), and then apply Lemma CD to handle the general case. Without loss of 
generality, it will be assumed that n is a power of 2, and hereafter we will let m = log(re) + 3. 

For clarity of exposition (and without loss of generality) we will assume that all our 
matchgates are drawn from a particular finite set chosen as follows. Recall from eq @ 
that matchgates correspond surjectively with 50(4, R) rotations. Using the standard Euler 
decomposition [8] any such 4-dimensional rotation can be expressed as a product of six ro- 
tations each of which acts nontrivially in one of the six 2-dimensional co-ordinate subspaces 
of M 4 . Accordingly we choose six basic matchgates that correspond to six such rotations, 
through an angle that is an irrational multiple of ir. For example we may simply take the 
rotations to be 



with rotation angle arctan3/4. Then any matchgate (to suitable precision) may be repre- 
sented as a circuit of these six basic matchgates. For convenience (cf Lemma [T]) we can also 
include G(X, X) and G(Z, X) in our basic set. 

Recall (cf eq. (|12p ) that the output probabilities of the matchgate circuit satisfy 



Here R = Rjy ■ ■ ■ R\ is the 2n-dimensional rotation corresponding to the product of the 
individual two-level rotations Rt of the matchgates (gt,it,H + 1) i n t ne circuit MG, and S 
is the 2n-dimensional matrix of eq. (jlOp . The circuit QC will be constructed to implement 
the computation illustrated in Figure 1 for the unitary U = 5" 1 RSR^ 1 . 

The real unitary operation U = S" -1 RSR' 1 acts in 2n dimensions, and for the compu- 
tation described in Figure 1 we need to provide a description of the controlled operation 
AU as a circuit of two-qubit gates on m rebit lines. Each basic rotation Rt corresponding 
to a matchgate (gt,it,H + 1) acts nontrivially in the span of two basis states selected from 
\2it — 1) , \2it) \2it + 1} and \2it + 2), and it acts as the identity in the (2n — 2)-dimensional 
complementary subspace. Our strategy now is the following: we begin by expressing each of 
these basic rotations as a constant sized circuit of multiply-controlled single qubit rotations. 
Then we will use the fact [3] that any r-fold controlled single qubit gate can be implemented 
using 0(r) one- and two-qubit operations, to build up our final circuit of elementary gates, 
of the claimed size. 



4 Proof of Theorem [2] 



4.1 Proof of Theorem [2(a) 




(15) 



po - Pi = (Zi) = (2\RSR^ |1) = {llS^RSBT 1 |1) . 
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Figure 1: The process performed in the translation of MG to QC involves log(2re) + 1 qubit 
lines all initialised to |0) . The middle gate is a controlled-?/ operation AU with the top line as 
control line. H denotes the Hadamard gate. The final Z\ measurement on the first line then 
has the unique binary probability distribution whose mean is {Z\) = Re (0 • • • 0| U |0 • • • 0). 
(Similar processes have been used many times in the quantum computing literature.) 



Unfortunately, if we represent the dimensions \i) as rebit lines via the usual binary 
representation of the labels i = 1, . . . , 2n, then R t will not be a two-qubit gate, and indeed 
it will generally act on all log(2n) rebit lines. To circumvent this problem we instead use a 
Gray code to associate rebit lines to the 2n dimensions. A Gray code [18] for the numbers 
l,...,2n is a sequence of distinct binary strings, each of length log(2ra), such that the 
strings corresponding to any consecutive numbers i and i + 1 have Hamming distance 1. 
Such codes exist and the Gray code strings can be easily computed from the standard binary 
representation via a simple sequence of single bit additions [18] . 

Now, as noted above, each basic matchgate rotation Rt acts within two of four consecu- 
tive dimensions and hence within the space of two Gray code strings of Hamming distance 
at most 3. If we conjugate Rt by suitable X and AX gates that act on the rebit lines where 
the Gray code strings differ, and that are controlled by the Gray code line values where 
they agree, then the Hamming distance may always be reduced to 1, i.e. each Rt (with the 
conjugations in place) can be made to act within the 2-dimensional subspace of a single 
rebit line conditioned on the bit values of all other lines that are specified by the (common) 
Gray code values on those lines. 

As an illustrative example, suppose Rt acts within the 2-dimensional span of the compu- 
tational basis states IOI213I215 • • • z m -2) and llO^Ozs • • • z m -2j having Hamming distance 3, 
where Z3Z5 ■ ■ ■ z m -2 is the fixed substring common to the two Gray code strings. If we apply 
the gate X2, then X4, then (AX)±2, and finally (AA)i4, each conditioned on the values 
Z3Z5 • • • z m -2 of lines 3 and 5 to m — 2, then we obtain the basis states IOOZ3OZ5 • • • z m -2) 
and I IOZ3OZ5 • • • z m -2), respectively, i.e. the rotation action is mapped into the 2-dimensional 
subspace of the first rebit controlled by the values Oz^Oz^ ■ ■ ■ z m -2 on the remaining lines. 

Next we use the algorithm from Section 7.5 of [3] for representing an r-fold controlled 
operation A r T (for a single qubit gate T) in terms of a circuit of 0(r) two-qubit gates, 
which requires the addition of one additional ancillary qubit. Applying this for r < m — 2, 
and where T is our 2-dimensional rotation or one of the extra X operations in the conjuga- 
tions above, we obtain the description of the controlled- R operation (and analogously also 
controlled-i? -1 ) needed for the computation in Figure HTT1 as a circuit of two-qubit gates. 
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Similarly, for the operation S, we see from its block diagonal form eq. (fTUI) that S can be 
represented as a product of operations each of which acts non-trivially only in 2 consecutive 
dimensions. Hence using the same techniques as above we can express S as a circuit of 
2-local gates on rebit lines corresponding to the Gray code. Alternatively we can note that 
eqs. (|10|lip show that in the usual binary representation of the basis state labels, S has the 
local form S = Y\. Then using a simple O(logn) sized circuit |18j that translates between 
the Gray code and usual binary representation, we can get a circuit of size O(logn) for the 
required controlled-,!? operations. 

Assembling these ingredients, we obtain a circuit on m = log(2n)+2 rebit lines described 
in Figure ETT1 for the operation U = S~ 1 RSR~ 1 . If the original matchgate circuit had size 
N and width n then the final circuit has size O(iVlogn) and width m. The O(logn)- 
factor increase in size arises from the algorithm of [3] to decompose an 0(log n)-controlled 
operation into elementary operations. Thus we have achieved the transformation 

MG{n- N; • • • 0; 1) -> QC{m; 0(N log n); • • • 0). 

For the general case where MG takes the form MG(n; N; x\ ■ ■ ■ x n ; k), one may follow 
the same procedure above after first applying the translation process of Lemma [TJ This 
requires the concatenation of sequences of gates to the beginning and end of the encoding 
of the circuit MG, but otherwise does not involve any computation on the gates of MG 
itself. This results in a matchgate circuit of the standard form MG(n; N + 0(n 2 ); • • • 0; 1). 

Finally we point out that the succession of translations described above can be achieved 
by a classical computation bounded to take place in 0(logn) space by noting the following 
facts: 

(i) The gates of MG are processed one at a time to obtain the gates of QC. The com- 
putation requires one pass through the encoding of MG to construct the gates of QC 
corresponding to the operator R, and a second pass in the opposite direction to obtain 
the gates of QC corresponding to R . 

(ii) Each gate of MG is encoded as a string of length O(logn), and it is clear that the 
computation of the gates of QC corresponding to each such gate can be computed 
in space O(logn). In particular, space O(logn) is sufficient to translate any number 
i S {1, . . . , 2n} expressed in binary into its Gray code representation, to compute the 
required conjugations by X and AX gates, and to implement the procedure of [3] on 
the required gates (keeping in mind that the individual two-qubit gates resulting from 
this procedure are output sequentially and need not all be stored simultaneously). 

(iii) The additional gates that are added by the construction of Lemma 1 are easily gen- 
erated one-by-one in space O(logn), and can be incorporated into the procedure 
described above by the addition of O(logra) space. 

This completes the proof Theorem [2]^a). 
4.2 Proof of Theorem [2](b) 

Given a description of a quantum circuit QC(m; M\y\ ■ ■ ■ y m ) we will generate an equiv- 
alent matchgate circuit MG(n = 2 m+1 ;N = 0(2 2m M);x 1 ■ ■ ■ x n = . . . 0; k = 1) and the 
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translation of description will be computable in space 0(m). Without loss of generality we 
may take y\ ■ ■ ■ y m = • • • (as the description of the QC circuit may be easily prefixed by 
X gates corresponding to the given y\ ■ ■ ■ y m values.) 

Let V denote the total unitary operation of the QC circuit. Then its output distribution 
is the unique binary distribution with mean 

<Zi) O cr = <0---0|V+ZiF|0---0). (16) 

On the other hand, for a hypothetical matchgate circuit MG of width n we have 

{Z 1 ) MG = (2\RSR- 1 \1) 

where -R, 5 £ SO(2n, M) arise from the circuit as described in Section [2 If we represent the 
2n dimensions in binary as rebit lines then eq. (|lip shows that S = Y\ and 

(Z 1 ) MG = (2\RY 1 R- 1 \1). (17) 

We will define MG by exploiting the structural similarity between eqs. (116p and (117p . 
Roughly speaking we will associate matchgates to the successive gates of V, choosing them 
so that their adjoint actions as per eq. (j4|) will give an overall rotation that reproduces V* . 
In this metamorphosis of eq. (|16p into eq. (|17p we will need to address three discrepancies 
between these formulae: (i) V is generally complex whereas R is real; (ii) {Z\)mg 1S ex_ 
pressed as an off-diagonal matrix element whereas {Z\)qc is a diagonal element; and (iii) 
{Z\)qc has a central Z\ term whereas {Z\)mg has Y\ in this position. 

We begin by treating (ii) and (iii) together. Introduce the two-qubit controlled operation 

v = !+}(+! ® 1+ \-){-\®Y 

where |±) = 7/g(|0) i are X -basis states. Then 

v j (Yi (g) I 2 )v = Z X ® {-%). (18) 

Next introduce the qubit swap operation SWAP\2 and w\2 = SWAP\2 v. Then eq. (|18p 
gives 

w ] {h ® Y 2 )w = Zi® (-y 2 ). (19) 

Now we extend the QC circuit by introducing an ancillary qubit A (hereafter located in 
the rightmost position in the binary description of the computational basis) and a post- 
processing by w\a (where 1 denotes the first line of the given QC circuit) to obtain V = 
wiaV. Then eq. (|19p (with label 2 in that equation replaced by label A) gives 

{0---01 A \V^Y A V\0---00 A ) = (0---0\V^Z 1 V\0---0)(1 a \-Y a \0a) m) 

= (0---0\V^Z 1 V\0---0) . 1 ' 

Note that if we label our Hilbert space dimensions by natural numbers counting from 1 to 
2 rn+1 then |0 • • • 00^) and |0 • • • 01^) correspond to |1) and |2) respectively and eq. (j20[) gives 

(Z 1 ) QC = (2\V^Y A V\1). 
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Comparing this expression with (Z\)mg m eq. (fTT|) we see that the off-diagonal matrix 
elements as well as the central Y-terms (acting on the first line in both cases) are now in 
correspondence thus addressing issues (ii) and (hi) above. Furthermore the translation of 
the QC circuit description for V into that for V (i.e. adjoining a description of w\a) is a 
simple process that can be carried out even in constant space. 

To deal with issue (i) recall that real gates suffice for universal quantum computation 
[5] and given any quantum circuit there is an equivalent circuit comprising real gates from 
the special orthogonal group that has one extra ancillary rebit line B. This construction is 
a direct consequence of the algebra isomorphism 

a + ib H> al - bY = 

between complex numbers and a class of real 2x2 matrices. Correspondingly to any K-qubit 
unitary gate U € U(2 ) we associate the real SO(2 K+1 , R) gate 

U = Re(U) ®I B - hn.(U) <g> Y B 

and to any if-qubit state we associate the (K + l)-rebit state \ip) = Re(|^)) <S> \0) B + 
Im(|^)) (g> \1) B to map any quantum computation into an equivalent real one. 

If the gates U of the QC circuit are from a universal set of one- and two-qubit gates 
then the corresponding C/'s will be one-, two-, and three-rebit gates and we may assume 
that they are given already decomposed in terms of a finite real universal set of two-rebit 
gates. For this set we choose the six rotations through angle arctan3/4 (cf eq. f)15[) ) that act 
respectively in the six 2-dimensional co-ordinate subspaces of R . In this way we translate 
the original QC circuit into an equivalent circuit of width m + 2 comprising these basic 
real gates and the translation can be achieved in 0(m) space, for example by sequentially 
translating each successive gate of V into its real version. 

Each of the resulting basic gates is a rotation T^ y acting on some (not necessarily 
nearest neighbour) pair (k,k') of rebit lines. Let r be the associated «SO(4, M) matrix, 
which by our choice of basic gates, acts non-trivially only in a 2-dimensional subspace of 
R 4 . To associate a circuit of matchgates on n = 2 m+1 qubit lines we need to view the global 
gate T = I*. # Ij as an operator on the full In = 2 m+2 -dimensional space with basis 

labelled sequentially as |1) , . . . , \2n). By virtue of its tensor product structure the 2n x 2n 
matrix of T amounts to the application of n/2 parallel instances of r, acting in a set of n/2 
disjoint 4-dimensional co-ordinate subspaces. Furthermore r acts non-trivially in only 2 of 
its 4 dimensions so T = T\T2 • • • T n / 2 is then given as a product of n/2 rotations Tj, in 2n 
dimensions, each of which acts non-trivially only in the span of two basis states \ki) and \li) 
(and acts as the identity in the 2n — 2 dimensional complement). Now if \ki — li\ < 3 then 
the rotation Tj is induced directly by a single matchgate Gi via eq. ©. Here Gi is one of 
six basic matchgates that correspond to our choice of the six basic rotations. If \ki — li\ > 3 
recall that the matchgate G(Z, X) on qubit lines k, k + 1 induces an associated rotation 
that swaps the pairs (2k — 1, 2k) and (2k + 1, 2k + 2) with each other. Then the rotation 
T{ is induced by a basic matchgate Gi conjugated by a ladder of 0(n) G(Z,X) matchgates 
(to bring (ki,li) into a position with \ki — li\ < 3). Thus each T, corresponds to a circuit 
of 0(n) matchgates so T (comprising 0(n) Tj's) corresponds to a matchgate circuit of size 
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0(n 2 ). Then finally, our original QC circuit of width m and size M is translated into an 
equivalent matchgate circuit of width 2 m+1 and size 0(Mn 2 ) = 0(M2 2m ) as claimed. Note 
that in the final translation, the renaming of the six basic rotations by their corresponding 
matchgates is a simple operation but the corresponding line labels for the matchgate actions 
range from 1 to n = 2 m+1 needing 0(m)-bit specifications, which can be calculated in 0(m) 
space using standard arithmetic operations. This completes the proof of Theorem EJb). 

5 Concluding remarks 

We have demonstrated an equivalence (in the precise sense given in Section [3]) between the 
computational power of matchgate circuits of width n and universal unitary quantum com- 
putation within an exponentially compressed space bound of O(logra) qubits. This equiv- 
alence is particularly interesting in the case of classically efficiently simulatable quantum 
computations (cf Theorem [1]) where the matchgate circuits (and then also the correspond- 
ing log-space bounded circuits) have poly(ra) size. In the literature there are two seemingly 
different techniques for constructing classically simulatable quantum computations: (a) we 
have the normaliser formalism that underlies the simulability of stabiliser circuits, as ex- 
pressed in the Gottesman-Knill theorem [7]. Generalisations of this technique have been 
considered in [6] and it has been argued in [10] that the classical simulability of matchgate 
circuits in Theorem [1] can be understood in terms of a suitably generalised normaliser con- 
struction too. Characteristically this technique leads to circuits of gates that are subject 
to algebraic constraints (normaliser conditions) and despite their classical simulability, the 
associated computations can generate complex entangled states, (b) A second method for 
imposing classical simulability is to restrict the amount of entanglement in the states that 
may be generated in the course of the computation. Such states have correspondingly small 
classical descriptions which can then be exploited for classical simulation. Examples of this 
method include the near-separable computations of [11[ [25] and use of the matrix product 
formalism (cf [24] for a comprehensive recent review) to impose bounds on the Schmidt 
rank across any partition of a multi-qubit state. 

With the above in mind, our main results can be seen as providing a link between 
(a) and (b) showing that, at least for the case of matchgate computations, the associated 
normaliser formalism may be viewed as an example of a limitation on the amount of generic 
entanglement and vice versa. 

Our results (especially Corollary [2|) provide alternative characterisations of some classi- 
cally simulatable quantum complexity classes. Such classes, ipso facto contained within P 
(classical polynomial time computation), have the following interesting feature: suppose we 
have any class C of classically simulatable quantum computations (that is not just classical 
computation itself i.e. we have a nontrivial quantum gate involved). Then we can argue that 
if the computational power of C is full classical polynomial time (in a suitably strong sense, 
as below), it would follow that BPP equals BQP i.e. that the powers of polynomial time 
classical and quantum (bounded error) computing coincide. Thus it is significant that our 
poly-sized circuits of matchgates should have a computational power that is strictly weaker 
than P (and similarly for stabiliser circuits). The intuitive argument is the following: we in- 
terpret the condition that C has full classical poly-time power in a strong sense, as meaning 
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that any classical gate can be simulated (in a quantum-coherent fashion) by a circuit from 
C. Thus we can simulate a Toffoli gate. But it is known [19] that the Toffoli gate plus any 
nontrivial (basis-changing) quantum gate is efficiently universal for quantum computation. 
Hence if C also has any such nontrivial quantum gate, then in addition to being classically 
simulatable, C must also be quantum universal. Note that the above argument does not 
apply to some possible classes of classically simulatable computations such as those in [22] , 
that can involve further global restrictions on the structure of allowed circuits e.g. although 
it may be possible to simulate a Toffoli gate, the definition of the class may yet exclude its 
appearance in a general position within a circuit. 

Finally we mention some possible avenues of significance of our results for further phys- 
ical and implementational considerations. 

Matchgate circuits are known to include the real-time dynamics of one-dimensional 
XY spin chain Hamiltonians with nearest neighbor interactions (cf. e.g., [201 [12]). Thus 
Theorem [2]Ja) implies that a digital quantum simulation of the latter can be carried out 
employing a quantum computer which is exponentially smaller in its use of qubits than the 
original system. This may for example, allow for an experimental observation of a quantum 
phase transition induced by varying the parameters in the Hamiltonian (cf. [23]). 

If there exist experimental situations in which matchgates are especially easy to realise, 
then the simulation of universal quantum computation using matchgates, that is implied 
by Theorem EJb), may be of experimental interest to demonstrate quantum computational 
effects, albeit with an exponential overhead in circuit width. Recalling that matchgates 
correspond to unitary evolutions generated by quadratic Hamiltonians of fermionic creation 
and annihilation operators |20t ll2|. a prospective setting here might be a fermionic counter- 
part [4] of the Knill-Laflamme-Milburn scheme in linear optics, where the gates are supposed 
to be implemented by beam splitters using the point contacts of a two-dimensional electron 
gas. 

Lastly we recall that the symmetry of the special orthogonal group appeared abstractly 
in our developments, via the Clifford algebra formalism (cf eq. H|). However it can also be 
emergent physically as exotic quantum statistics. For example, in the fractional quantum 
Hall effect with the filling fraction v = 5/2 or in recent topological insulator-superconductor 
structures, the quasi-particle excitations in the celebrated Moore- Read state behave as non- 
abelian (Ising) anyons [2], that can realise non-abelian quantum statistics of SO(2n,M.) 
for 2n excitations [15], analogous to the transformations available in the matchgate for- 
malism. The experimental observation of such anyonic behaviour is of current interest for 
condensed-matter physics, as well as for a potential implementation of topological quan- 
tum computation, where the anyon braiding operations would need to be supplemented by 
further non-topological operations to achieve universal quantum computation in the full 
Hilbert space |13j . However Theorem EJb) implies that the SO(2n,M) action has an inher- 
ent universal computational capability, albeit within an exponentially compressed quantum 
space bound, that can then also be viewed in terms of matchgate computations. 
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A Appendix: Definitions of space-bounded computation 



The notion of classical space-bounded computation that we use is the standard textbook 
one (cf \17\ [2]). and for completeness we include a summary description here. The classical 
Turing machines (TMs) that we consider have three tapes called the input, work, and output 
tapes, respectively. Each tape is a sequence of cells which may contain one of three symbols: 
the blank symbol B, or one of the binary values or 1. (Often one allows more symbols, 
but these ones are sufficient for the present discussion.) Conventionally we take the tapes 
to be infinite to the right. 

At any stage the TM has an internal state q, chosen from a fixed finite set, and each 
tape has a tape head that scans one of its cells at each moment. The computational process 
of the TM is defined by a finite table of transition rules of the form 

(Q) •Sinputi ^work) ^ { r i ^workj ^output j ^input > G^work) (^1) 

where q is an internal state and Si nput , s wor k are symbols on the input and work tape, 
respectively. Each d is either L ("one step left"), R ("one step right") or S ("stay in the 
same place"). The meaning of this instruction is the following: if the TM has the internal 
state q, and is scanning the symbols s; npu t and s wor k on its input and work tapes, then 

(i) the internal state changes to r, 

(ii) the input tape head moves in direction di npu t, 

(iii) the symbol being scanned on the work tape is replaced by i wor k and the work tape 
head moves in direction d WOT ^, and 

(iv) if Output is a non-blank symbol, then this symbol is written on the output tape and 
the output tape head moves right (while nothing happens to the output tape or tape 
head if toutput is a blank symbol). 

These rules imply that the input tape is "read only" and the output tape is "write only". 
This is done so that the work tape effectively represents the memory of the Turing machine, 
upon which various limitations can be defined. It is also common to impose the restriction 
that if Si n p U t is the blank symbol then di np ut cannot be R, which forces the input tape head 
to remain at most one square away from the actual input string on the input tape. (If 
this condition is not imposed, the position of the input tape head could be exploited to 
effectively store a large integer value that is not counted as part of the Turing machine's 
memory usage.) We assume the machine to be deterministic, which means that there can 
be only one transition rule whose left-hand-side is given by any triple (q, s; npu t, s W ork)- 

To say that a TM is run on a particular binary input string w means that the TM 
is started in a distinguished starting state (/start > has w written left-justified on the input 
tape, one symbol per cell and with all other tape cells containing the blank symbol, and 
with all tape heads scanning the leftmost cells of their respective tapes. The transition 
rules then determine the steps of the computation. One of the TM's internal states qh&it is 
designated as the halting state, and the computation is assumed to stop in the event that 
this state is reached. If this happens, the binary string written on the output tape is taken 
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as the TM's output. A function g : X* — > X* is computed by the TM if, for every input 
string x € £*, the TM eventually halts and produces the output g(x). The function g is 
said to be computable in space bounded by f(n) if, for all inputs X — X j_ ' Xfif the work 
tape head remains within the first f(n) cells of the work tape throughout the computation. 
For decision problems, where the answer for each X 1 * * X fi IS or 1, one often disregards 
the output tape entirely and takes the first cell of the work tape, say, to hold the answer 
whenever the computation halts. 

Of particular interest to this paper are functions computable within space bounds of 
the form f(n) = O(logn), which is significantly smaller than the space required to hold 
the entire input. Functions that are computable within such a bound are called log-space 
computable functions. Note that a function g computable in log-space need not have cor- 
respondingly short outputs yi • • • y m = g(x\ ■ ■ ■ x n ), because a log-space bounded machine 
may still run for 0(poly(n)) steps |17^ [2] (albeit in cramped space conditions) before halting, 
and so we can potentially have m = 0(poly(n)) as well. 

A general notion and study of space-bounded quantum computation was given in [261 [27] . 
Here we will adopt a simplified version of the model along the lines presented in [28]. 
Although one may seek to directly generalise the classical definition above, using a fully 
quantum notion of a quantum Turing machine of the kind described in [5], it appears to 
be much easier to work instead with a classical-quantum hybrid TM model in which some 
aspects of the machine are required to always remain classical. 

We start with a classical TM as above, and add an additional tape (called the quantum 
work tape) with the following features: 

(i) the cells of the quantum work tape are qubits, each of which is initialised to |0) at the 
start of the computation, and 

(ii) there are two heads, with classical positions, scanning the quantum work tape. 

We will also eliminate the output tape as our focus will be on decision problems, although 
one could make further modifications to the model to allow for a classical output tape as 
well. 

The operation of the machine is defined by a finite set of instructions similar to eq. (|2ip . 
but suitably modified so that quantum gates may be applied to the qubits on the quantum 
tape during the course of the computation. In particular, we may assume that each internal 
state q has associated to it a two-qubit quantum gate U q , chosen from a universal set 
(which should include the identity operation for the programmer's convenience). Then a 
basic transition rule for the machine has the form: 

(OS S' m , S wor k) y (T,t wor k, di Q p U t, dwQj-fc, dql, dq2*)- (^2) 

The meanings of such a transition is similar to the ones described previously, except that 
d q i and d q 2 denote movement instructions for the two quantum work tape heads. The 
computation proceeds in a similar way as before — but with the addition that if the two 
quantum work tape heads are scanning different squares at the start of a given step, then 
the two-qubit operation U q is applied to the two scanned qubits before the tape heads move. 
In the event that the machine enters the state q a &\t, the leftmost quantum work tape qubit 
is measured in the computational basis to determine the output of the computation. 
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The quantum computation is said to occur within space bound f(n) if, for every input 
x\ ■ ■ ■ x n given on the input tape, the work tape head and qubit tape heads remain within 
the first /(n) cells of their respective tapes throughout the computation. Because our 
output is a single bit, this provides a notion of a decision problem being computed by an 
/(n)-space-bounded quantum computation. 

It is important to point out that a space f{n) bounded quantum computation may 
be alternatively thought of as just a circuit of quantum gates applied to /(n) qubit lines, 
all initialised to |0), where the circuit is determined by a classical space f(n) bounded 
computation on the input x\ ■ ■ ■ x n , i.e. a "space-/(n) generated" family of quantum circuits. 
Indeed, in our hybrid classical-quantum TM, the quantum work tape may be replaced by a 
classical output tape just as in the fully classical model, and instead of applying quantum 
operations, the corresponding gate names and line numbers are sequentially written on the 
output tape. 

Note that apart from the final measurement, all our quantum operations are required to 
be unitary gates. One may entertain a more general scenario in which general non-unitary 
quantum operations (completely positive trace preserving maps) on two qubits are allowed, 
as well as a mechanism for measurements of qubits to occur during the computation that 
can influence the classical parts of the machine. In the ubiquitous scenario of polynomial 
time and polynomial width quantum computations, say in terms of polynomial-sized quan- 
tum circuits, this provides no further generality because arbitrary quantum operations may 
always be represented via unitary operations with the inclusion of extra ancillary qubit 
lines. However, for space-bounded computation this generalisation can be non-trivial. For 
example, a log-space computation with general non-unitary gates could involve polyno- 
mially many steps, and to simulate such a computation by a unitary process in the most 
straightforward way would hence require a polynomial number of ancillary qubit lines, which 
are not available in log-space. The general notion of space-bounded quantum computation 
with general non-unitary gates has been considered in [26\ I27j . but for the purpose of the 
present paper we restrict ourselves to the model with unitary gates, which we accordingly 
call space-bounded unitary quantum computation. 
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